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<400> 1 

gccaggcgga 

tcatcaatgc 

ctcgtggctc 

cttgcgcgct 

gcctgggcga 

tgaccattgc 

acatcatggg 

ggattgacca 

gcccccgctc 

ccgctattga 

ccaacgactg 

gccagttcac 

tgtgggagga 

tctcggacgg 

ccccgctgac 

ccgacaagct 

gcggtgtgga 

tggacattga 

acagcgt eta 

ctgtggaccc 

tggacatcat 

tcctgggtac 

agggccgcgc 

gcgccgactt 

ccatgcacta 

aggagggegt 

acgccgacgc 

accccgagat 

agtgggaggg 

aggaggagat 

acgcccccaa 

ecaaggtegg 

cctcgcccgc 

ccaagcccaa 

agaaeggege 



gcgtatggct 
cgcgtcgttc 
cgcacgcaag 
ggacatcgtg 
tgtgactggt 
ccctcgctac 
cgagaaggtc 
cccctggttc 
cggcgctgac 
ggctgcccgc 
gcactccgcc 
caaggecaag 
ggctttcaag 
etatgecaag 
gggaaagacc 
ggtgactgtg 
gctggacacc 
ggagtggaac 
cgccggcaag 
caccgccccc 
cctggccgcc 
cggcaaggcc 
caagggegtg 
catgctggtg 
eggtacegtg 
caccggcttc 
cctggccgcc 
ggtggccaac 
ectgetggag 
caaggtgccc 
caccctgaag 
caccaccgcc 
cgccgccacc 
gaccgctggc 
tgcctccaac 



gttgectcta 
ggtgtcaaga 
tccacctcgc 
atggttgctg 
ggcctgccta 
gaccagtacg 
cgctacttcc 
ctggccaagg 
tacctggaca 
gtgctgccct 
ctggtgcccg 
tcggtgctgg 
gacacgaagc 
gtttacactg 
tacaagaaga 
tcgcccaact 
gtcatccgcg 
cccaagaccg 
gccgccgcca 
ctgttcgcct 
ctgcccaaga 
gectacgaga 
gtcaagttct 
ccctcgcgct 
cccgtggtag 
cacatgggcg 
accgtgcgcc 
tgcatcagcc 
gaggtggtgt 
gttgecgaga 
cccgtgtccg 
cccgccatgg 
cccaaggtga 
ctcaagctgg 
ggcaaeggea 



ccagccgccc 
agaccgegaa 
geteggctgt 
ctgaggtege 
ttgagctggt 
ctgacgcctg 
actccatcaa 
tctggggcaa 
accacaagcg 
tcggccccgg 
tectgetgaa 
ctatccacaa 
tgccccaggc 
aggccacccc 
tcaactggct 
acgcgaccga 
ccaagggcat 
acaagttcct 
aggaggcect 
teat eggecg 
tcctggccac 
agctggtgaa 
cggcgcccct 
tcgagccctg 
cctccaccgg 
ccctgaaccc 
gtgecagega 
aggacctgtc 
aeggcaaggg 
agatccccgg 
cctccgtgga 
gcgcgtggcg 
ccapctacaa 
ccggtgaggc 
acggtgcctc 



cagcagcgcg 
ccagctgctg 
tactggtgcc 
cccttggtcc 
caagcgcggc 
ggacacctcg 
gaagggcgtg 
gaccggctcc 
cttcgccctg 
egaggactge 
ggacgagtac 
catcgccttc 
cgcctttgac 
catggaggag 
gaagggtggc 
gatcgctgcc 
tgagggcatt 
gtctgcgccc 
geaggecgag 
cctggaggag 
ccccaaggtg 
cgccatcggc 
ggcgcacatg 
cggcctgatc 
cggcctggtc 
cgacaagctg 
ggtgtttgcg 
ctggtccaag 
cggcgtggcc 
cgacctgccc 
gggcaaegge 
cgcgaccacc 
gcccgccctg 
ctccaccacc 
ggcctccaag 



cgtcctatcg 
cgtgagctt-g 
actggtgcca 
aagacgggcg 
caccgcgtca 
gtggtcgtgg 
caccgcgtgt 
aagctgtac.g 
ttctgeaagg 
gtctt cgtgg 
cagcccaagg 
cagggccgca 
aagctggcct 
gacgagaagc 
attatcgecg 
gatgccgccg 
( gtgaaeggea 
1 tacgaccaga 
ctgggcctgc 
cagaagggtg 
cagatcgcca 
accaagtaca 
ctcaccgccg 
cagctgcacg 
gacaccgtca 
gacgaggctg 
ggcggccgct 
cccgcccaga 
accgccaaga 
gccgtgtcct 
gccgccgcgc 
ccctcgggcc 
cccgccaccg 
tcgacctcgg 
acctcggctg 



60 

120 

180 

240 ■ 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 



ccaagcccct 
gcggcgacag 
gcggccgcca 
gagcatggca 
tgaagtcggt 
gtttcggttt 
tgcgcggcgc 
tcggcggcaa 
gagagtttgc 
cctccggcaa 
ggatacataa 
ggttttgttg 
cttgggctca 
ggct cgaagc 
cttgactgga 
acccagtagt 
tgagagagta 



ggtctccgcc 

catgagcggc 

ccggcgagga 

ggcggaaacg . 

tttgcgagac 

tgcccaagtc 

ggcgaccacg 

gagggcgggt 

cattttgtgg 

cggagaagtg 

gagcgtgtgg 

aggtgcagac 

tttttggtgc 

gagggtttgt 

ggtacacaaa 

gtggaaagta 

acaaagatgg 



gccacccgca 
tcgaccaaag 
gcaggcttgc 
tgtgtactga 
cggagaaacg 
caaaagaaga 
ccggcccgtg 
ctgtacactc 
gacgtgagcg 
ggaggcgctg 
aatggtggta 
ggcaccgtcg 
ccgtcgatga 
ggaagtggag 
gtggagcgtg 
gttggacctg 
cacctgaaaa 



agtccgccta 
ctgtggcagg 
ggcagcgagg 
aatgtggtgc 
ccggtttggt 
gtaacgaaac 
cgcagcctgt 
catccattcc 
gcgggacggc 
tagcccggtg 
aaagaggagg 
gcgtcaaagg 
tgagagattg 
cgaggagggt 
cgacggcacg 
aattctttga 
aaaaaaaaaa 



aagcggcagt 
aacggctgta 
gcgatgagct 
atgagagtgt 
tttgtagtgc 
tgtagcagta 
cctgccctca 
aggatttttg 
cgcgccgggc 
accccccaat 
ggcctgggtc 
ccctcgcaag 
gccagcggtt 
tggagaaaga 
gaggcattgg 
gagtaccgcg 
aaaaaaaaaa 



agccgcagag 
gcagcggcag 
tagcggccgt 
cgtgctgtaa 
agggcctgtg 
gcagagcact 
gccttgtgat 
caggctgcct 
tctcctaccg 
gtagaggatg 
gcccctcgat 
gcccgggtgc 
ttttgaggct 
ggcggacatg 
cggactattg 
cattaatccg 
aaaaaaa 



2160 
2220 
2280 
2340 
2 4 00 
2460 
'2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3117 



<210> 2 
<211> 2124 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: fragment 

of the complete sequence of cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (2124) 



<400> 2 

atg get gtt gec tct acc age cgc ccc age age gcg cgt cct ate gtc 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 

1 5 10 15 



48 



ate aat gee gcg teg ttc ggt gtc aag aag acc gcg aac cag ctg ctg 
lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 



96 



cgt gag ctt get cgt ggc tec gca cgc aag tec acc teg cgc teg get 
Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 



144 



gtt act ggt gec act ggt gee act tgc gcg ctg gac ate gtg atg gtt 
Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp lie Val Met Val 
50 55 60 



192 



get get gag gtc gee cct tgg tec aag acg ggc ggc ctg ggc gat gtg 
Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



240 



act ggt ggc ctg cct att gag ctg gtc aag cgc ggc cac cgc gtc atg 
Thr Gly Gly Leu Pro lie Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 



288 



acc att gec cct cgc tac gac cag tac get gac gee tgg gac acc teg 336 
Thr lie Ala Pro Arg Tyr Asp Gin Tyr Ala Asp Ala Trp Asp Thr Ser 
100 105 110 



3 



gtg gtc gtg gac ate atg ggc gag aag gtc cgc tac ttc cac tec ate 384 

Val Val Val Asp lie Met Gly Glu Lys Val Arg Tyr Phe His Ser lie 

115 120 125 

aag aag ggc gtg cac cgc gtg tgg att gac cac ccc tgg ttc ctg gec .432 

Lys Lys Gly Val His Arg Val Trp lie Asp His Pro Trp Phe Leu Ala 

130 135 140 

aag gtc tgg ggc aag acc ggc tec aag ctg tac ggc ccc cgc tec ggc 480 

Lys Val Trp Gly Lys Thr Gly Ser Lys Leu Tyr Gly Pro Arg Ser Gly 

145 150 155 160 

get gac tac ctg gac aac cac aag cgc ttc gee ctg ttc tgc aag gec 528 

Ala Asp Tyr Leu Asp Asn His Lys Arg Phe Ala Leu Phe Cys Lys Ala 

165 170 175 

get att gag get ggc cgc gtg ctg ccc ttc ggc ccc ggc gag gac tgc 576 

Ala lie Glu Ala Ala Arg Val Leu Pro Phe Gly Pro Gly Glu Asp Cys 
180 185 190 

gtc ttc gtg gee aac gac tgg cac tec gec ctg gtg ccc gtc ctg ctg 624 

Val Phe Val Ala Asn Asp Trp His Ser Ala Leu Val Pro Val Leu Leu 

195 200 205 

aag gac gag tac cag ccc aag ggc cag ttc acc aag gee aag teg gtg 672 

Lys Asp Glu Tyr Gin Pro Lys Gly Gin Phe Thr Lys Ala Lys Ser Val 

210 215 220 

ctg get ate cac aac ate gec ttc cag ggc cgc atg tgg gag gag get 720 

Leu Ala lie His Asn lie Ala Phe Gin Gly Arg Met Trp Glu Glu Ala 

225 230 235 240 

ttc aag gac acg aag ctg ccc cag gee gee ttt gac aag ctg gee ttc 768 

Phe Lys Asp Thr Lys Leu Pro Gin Ala Ala Phe Asp Lys Leu Ala Phe 

245 250 255 

teg gac ggc tat gee aag gtt tac act gag gee acc ccc atg gag gag 816 

Ser Asp Gly Tyr Ala Lys Val Tyr Thr Glu Ala Thr Pro Met Glu Glu 
260 265 270 

gac gag aag ccc ccg ctg acg gga aag acc tac aag aag ate aac tgg 864 

Asp Glu Lys Pro Pro Leu Thr Gly Lys Thr Tyr Lys Lys lie Asn Trp 

275 280 285 

ctg aag ggt ggc att ate gee gee gac aag ctg gtg act gtg teg ccc 912 

Leu Lys Gly Gly lie lie Ala Ala Asp Lys Leu Val Thr Val Ser Pro 

290 295 300 

aac tac gcg acc gag ate get gec gat gee gec ggc ggt gtg gag ctg 960 

Asn Tyr Ala Thr Glu lie Ala Ala Asp Ala Ala Gly Gly Val Glu Leu 

305 310 315 4 320 

gac acc gtc ate cgc gee aag ggc att gag ggc att gtg aac ggc atg 1008 

Asp Thr Val lie Arg Ala Lys Gly lie Glu Gly lie Val Asn Gly Met 

325 330 335 

gac att gag gag tgg aac ccc aag acc gac aag ttc ctg tct gcg ccc 1056 

Asp lie Glu Glu Trp Asn Pro Lys Thr Asp Lys Phe Leu Ser Ala ; Pro 
340 345 350 



4 



tac gac cag aac age gtc tac gec ggc aag gec gec gec aag gag gec 1104 
Tyr Asp Gin Asn Ser Val Tyr Ala Gly Lys Ala Ala Ala Lys Glu Ala 
355 360 365 

ctg cag gec gag ctg ggc ctg cct gtg gac ccc acc gec ccc ctg ttc 1152 
Leu Gin Ala Glu Leu Gly Leu Pro Val Asp Pro Thr Ala Pro Leu Phe 
370 375 380 



gec ttc ate ggc cgc ctg gag gag cag aag ggt gtg gac ate ate ctg 
Ala Phe lie Gly Arg Leu Glu Glu Gin Lys Gly Val Asp lie lie Leu 
385 390 395 400 



1200 



gee gee ctg ccc aag ate ctg gec acc ccc aag gtg cag ate gee ate 
Ala Ala Leu Pro Lys lie Leu Ala Thr Pro Lys Val Gin lie Ala lie 
405 410 415 



1248 



ctg ggt acc ggc aag gee gee tac gag aag ctg gtg aac gee ate ggc 
Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala lie Gly 
420 425 430 



1296 



acc aag tac aag ggc cgc gee aag ggc gtg gtc aag ttc teg gcg ccc 1344 

Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

ctg gcg cac atg etc acc gec ggc gec gac ttc atg ctg gtg ccc teg 1392 

Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

cgc ttc gag ccc tgc ggc ctg ate cag ctg cac gee atg cac tac ggt 1440 

Arg Phe Glu Pro Cys Gly Leu lie Gin Leu His Ala Met His Tyr Gly 

465 470 475 480 



acc gtg ccc gtg gta gec tec acc ggc ggc ctg gtc gac acc gtc aag 1488 
Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

gag ggc gtc acc ggc ttc cac atg ggc gee ctg aac ccc gac aag ctg 1536 
Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 



gac gag get gac gec gac gee ctg gee gee acc gtg cgc cgt gec age 1584 
Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

gag gtg ttt gcg ggc ggc cgc tac ccc gag atg gtg gec aac tgc ate 1632 
Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys lie 
530 535 540 



age cag gac ctg tec tgg tec aag ccc 
Ser Gin Asp Leu Ser Trp Ser Lys Pro 
545 550 

ctg gag gag gtg gtg tac ggc aag ggc 
Leu Glu Glu Val Val Tyr Gly Lys Gly 
565 

gag gag ate aag gtg ccc gtt gee gag 
Glu Glu lie Lys Val Pro Val Ala Glu 
580 585 



gee cag aag tgg gag ggc ctg 1680 
Ala Gin Lys Trp Glu Gly Leu 
555 f 560 

ggc gtg gee acc gee aag aag 1728 
Gly Val Ala Thr Ala Lys Lys 
570 575 

aag ate ccc ggc gac ctg ccc 177 6 
Lys lie Pro Gly Asp Leu Pro 
590 



5 





gcc 


gtg 


tec 


tac 




J_ a 


Vdl 




Tyr 








595 






gag 


ggc 


aac 


ggc 




Ol LI 


oi y 










610 








atg 


ggc 


gcg 


tgg 




Met 


Gly 


Ala 


Trp 




625 










gcc 


acc 


ccc 


aag 




Ala 


1 nr 


Pro 


Lys 




aag 


CCC 


aag 


acc 




Lys 


Pro 


Lys 


i nr 










660 




teg 


acc 


teg 


gag 




Ser 


Thr 


Ser 


Glu 








675 






teg 


gcc 


tec 


aag 




Ser 


Ala 


Ser 


Lys 






690 






13 












cgc 


aag 


tec 


gcc 




Arg 


Lys 


Ser 


Ala 




705 









600 605 



615 620 



630 635 640 



645 650 655 



665 670 



680 685 



695 700 



!dk <210> 3 

<211> 708 
I <212> PRT 

5 <213> Artificial sequence 

rf <223> Description of the artificial sequence: fragment 

^ of the complete sequence of cDNA coding for the GBSSI 

of Chlamydomonas reinhardtii 

<400> 3 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 
15 10 i; 15 

lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 

Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp lie Val Met Val 
50 55 60 

Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2124 



Thr Gly Gly Leu Pro lie Glu Leu Val Lys Arg Gly His Arg Val Met 
8 5 , 90 95- 



Thr lie Ala Pro 
100 

Val Val Val Asp 
115 

Lys Lys Gly Val 
130 

Lys Val Trp Gly 
145 

Ala Asp Tyr Leu 



Ala lie Glu Ala 
180 

Val Phe Val Ala 
195 

Lys Asp Glu Tyr 
210 

Leu Ala lie His 
225 

Phe Lys Asp Thr 



Ser Asp Gly Tyr 
260 

Asp Glu Lys Pro 
275 

Leu Lys Gly Gly 
290 

Asn Tyr Ala Thr 
305 

Asp Thr Val lie 



Asp lie Glu Glu 
340 

Tyr Asp Gin Asn 
355 

Leu Gin Ala Glu 
370 

Ala Phe lie Gly 
385 

Ala Ala Leu Pro 



Arg Tyr Asp Gin 



lie Met Gly Glu 
120 

His Arg Val Trp 
135 

Lys Thr Gly Ser 
150 

Asp Asn His Lys 
165 

Ala Arg Val Leu 



Asn Asp Trp His 
200 

Gin Pro Lys Gly 
215 

Asn lie Ala Phe 
230 

Lys Leu Pro Gin 
245 

Ala Lys Val Tyr 



Pro Leu Thr Gly 
280 

lie lie Ala Ala 
295 

Glu lie Ala Ala 
310 

Arg Ala Lys Gly 
325 

Trp Asn Pro Lys 



Ser Val Tyr Ala 
360 

Leu Gly Leu Pro 
375 

Arg Leu Glu Glu 
390 

Lys lie Leu Ala 
405 



Tyr Ala Asp Ala 
105 

Lys Val Arg Tyr 



lie Asp His Pro 
140 

Lys Leu Tyr Gly 
155 

Arg Phe Ala Leu 
170 

Pro Phe Gly Pro 
185 

Ser Ala Leu Val 



Gin Phe .Thr Lys 
220 

Gin Gly Arg Met 
235 

Ala Ala Phe Asp 
250 

Thr Glu Ala Thr 
265 

Lys Thr Tyr Lys 



Asp Lys Leu Val 
300 

Asp Ala Ala Gly 
315 

lie Glu Gly lie 
330 

Thr Asp Lys Phe 
345 

Gly Lys Ala Ala 



Val Asp Pro Thr 
380 

Gin Lys Gly Val 
395 

Thr Pro Lys Val 
.410 



Trp Asp Thr Ser 
110 

Phe His Ser lie 
125 

Trp Phe Leu Ala 



Pro Arg Ser Gly 
160 

Phe Cys Lys Ala 
175 

Gly Glu Asp Cys 
190 

Pro Val Leu Leu 
205 

Ala Lys Ser Val 



Trp Glu Glu Ala 
240 

Lys Leu Ala Phe 

255 

Pro Met Glu Glu 
270 

Lys lie Asn Trp 
285 

Thr Val Ser Pro 



Gly Val Glu Leu 
320 

Val Asn Gly Met 
• 335 

Leu Ser Ala Pro 
350 

Ala Lys Glu Ala 
365 ' 

Ala Pro Leu Phe 



Asp lie lie Leu 
400 

Gin lie Ala lie 
415 



7 



Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala lie Gly 
420 425 430 

Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

Arg Phe Glu Pro Cys Gly Leu lie Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys lie 
530 535 . 540 

Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

Glu Glu lie Lys Val Pro Val Ala Glu Lys lie Pro Gly Asp Leu Pro 
580 585 590 

Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 
595 600 605 

Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 
610 615 620 

Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 640 

Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 ' 655 

Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 



Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 



Arg Lys Ser Ala 
705 




8 



<210> 4 
<211> 1953 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI of 
Chlamydomonas reinhardtii and coding for the mature GBSSI 
protein . 

<220> 

<221> CDS 

<222> (1) . . (1953) 



<400> 4 

gcg ctg gac ate gtg atg gtt get get gag gtc gee cct tgg tec aag 48 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
1 5 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
q Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 

jfi 20 25 30 

,S aag cgc ggc cac cgc gtc atg ace att gec cct cgc tac gac cag tac 144 

.12 Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 

H 35 40 -45 

get gac gec tgg gac ace teg gtg gtc gtg gac ate atg ggc gag aag 192 
. Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 

i 50 55 60 

□ 

j-i gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 

"j^ Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 

U * 65 70 75 80 

gac cac ccc tgg ttc ctg gec aag gtc tgg ggc aag ace ggc tec aag 288 
5^ Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 

85 90. 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gee get att gag get gec cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gee ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc ace aag gec aag teg gtg ctg get ate cac aac ate gec ttc cag 528 
Phe Thr Lys Ala Lys Ser Va'l Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 



9 



ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gec 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

gec ttt gac aag ctg gec ttc teg gac ggc tat gec aag gtt tac act . 624 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gec acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gec gee gac 720 
Thr Tyr Lys Lys -lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 " 230 ' 235 . 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gee gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

^ gee gee ggc ggt gtg gag ctg gac acc gtc ate cgc gee aag ggc att 816 

;£ Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly He 

'5 260 265 270 

■ Ls 

9 a 9 ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 

U Glu Gly He Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 

•*J 275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gee ggc 912 

* Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 

J«% 290 295 300 

j~ aag gee gee gee aag gag gee ctg cag gee gag ctg ggc ctg cct gtg 960 

\Z L ys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 

|3 305 310 315 320 

gac ccc acc gee ccc ctg ttc gee ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg . Leu Glu Glu Gin 
325 330 335' 

aag ggt gtg gac ate ate ctg gee gec ctg ccc aag ate etg gec acc 1056 
Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 356 

ccc aag gtg cag ate gee ate ctg ggt acc ggc aag gee gee tac gag 1104 
Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gee ate ggc acc aag tac aag ggc cgc gee aag ggc 1152 
Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gee ggc gee 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He' Gin 
405 410 415 
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ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 

420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc _ 1344 

Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 

435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gcc gac gcc ctg gcc 1392 

Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 

450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 1440 

Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 

465 "- 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 

Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 

485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 

Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 

500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 

Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 

515 520 525 

aag ate ccc ggc gac ctg ccc gcc gtg tec tac gcc ccc aac acc ctg 1632 

Lys lie Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 

530 535 540 

aag ccc gtg tec gcc tec gtg gag ggc aac ggc gcc gcc gcg ccc aag 1680 

Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 

545 550 555 560 

gtc ggc acc acc gcc ccc gcc atg ggc gcg tgg cgc gcg acc acc ccc 1728 

Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 

565 570 575 

teg ggc ccc teg ccc gcc gcc gcc acc ccc aag gtg acc acc tac aag 1776 

Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 

580 585 590 

ccc gcc ctg ccc gcc acc gcc aag ccc aag acc get ggc etc aag ctg 1824 

Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 

595 600 605 

gcc ggt gag gcc tec acc acc teg acc teg gag aac ggc get gcc tec 1872 

Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 

610 615 620 

aac ggc aac ggc aac ggt gcc teg gcc tec aag acc teg get gcc aag 1920 

Asn Gly Asn' Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 

625 630 635 640 



ccc ctg gtc tec gcc gcc acc cgc aag tec gcc 
Pro Leu Val Ser Ala Ala Thr Arg Lys £er Ala 
645 650 



1953 
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<210> 5 
<211> 651 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii and coding for the mature 
GBSSI protein 

<400> 5 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 " 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 ' 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 

165 170 175 

,i 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 
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Glu Gly He Val 
275 

Asp Lys Phe Leu 
290 

Lys Ala Ala Ala 
305 

Asp Pro Thr Ala 



Lys Gly Val Asp 
340 

Pro Lys Val Gin 
355 

Lys Leu Val Asn 
370 

Val Val Lys Phe 
385 

Asp Phe Met Leu 



Leu His Ala Met 
420 

Gly Leu Val Asp 
435 

Ala Leu Asn Pro 
450 

Ala Thr Val Arg 
465 

Glu Met Val Ala 



Ala Gin Lys Trp 
500 

Gly Val Ala Thr 
515 

Lys He Pro Gly 
530 

Lys Pro Val Ser 
545 

Val Gly Thr Thr 



Ser Gly Pro Ser 
580 



Asn Gly Met Asp 
280 

Ser Ala Pro Tyr 
295 

Lys Glu Ala Leu 
310 

Pro Leu Phe Ala 
325 

lie He Leu Ala 



He Ala He Leu 
360 

Ala He Gly Thr 
375 

Ser Ala Pro Leu 
390 

Val Pro Ser Arg 
405 

His Tyr Gly Thr 



Thr Val Lys Glu 
440 

Asp Lys Leu Asp 
455 

Arg Ala Ser Glu 
470 

Asn Cys He Ser 
485 

Glu Gly Leu Leu 



Ala Lys Lys Glu 
520 

Asp Leu Pro Ala 
535 

Ala Ser Val Glu 
550 

Ala Pro Ala Met 
565 

Pro Ala Ala Ala 



He Glu Glu Trp 



Asp Gin Asn Ser 
300 

Gin Ala Glu Leu 
315 

Phe He Gly Arg 
330 

Ala Leu Pro Lys 
345 

Gly Thr Gly Lys 



Lys Tyr Lys Gly 
380 

Ala His Met Leu 
395 

Phe Glu Pro Cys 
410 

Val Pro Val Val 
425 

Gly Val, Thr Gly 



Glu Ala Asp Ala 
460 

Val Phe Ala Gly 
475 

Gin Asp Leu Ser 
490 

Glu Glu Val Val 
505 

Glu He Lys Val 



Val Ser Tyr Ala 
540 

Gly Asn Gly Ala 
555 

Gly Ala Trp Arg 
570 

Thr Pro Lys Val 
585 



Asn Pro Lys Thr 
285 

Val Tyr Ala Gly 



Gly Leu Pro Val 
320 

Leu Glu Glu Gin 
335 

He Leu Ala Thr 
350 

Ala Ala Tyr Glu 
365 

Arg Ala Lys Gly 



Thr Ala Gly Ala 
400 

Gly Leu He Gin 
415 

Ala Ser Thr Gly 
430 

Phe His Met Gly 
445 

Asp Ala Leu Ala 



Gly Arg Tyr Pro 
480 

Trp Ser Lys Pro 
495 

.i 

Tyr Gly Lys Gly 
510 

Pro Val Ala Glu 
525 

Pro Asn Thr Leu 



Ala Ala Pro Lys 
560 

Ala Thr Thr Pro 
575 

Thr Thr Tyr Lys 
590 
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Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 



Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



<210> 6 
<211> 1314 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (1314) 

<400> 6 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

1 5 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gec cct cgc tac gac cag tac 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gee tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gee aag gtc tgg ggc aag acc ggc tec aag 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gee get att gag get gee cgc gtg ctg ccc 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
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115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gcc aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 • 140 

gcc ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gcc aag teg gtg ctg get ate cac aac ate gcc ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn He Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gcc 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

gcc ttt gac aag ctg gcc ttc teg gac ggc tat gcc aag gtt tac act 624 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gcc acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 
Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 1056 
Lys Gly Val Asp He lie Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 
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aag ctg gtg 
Lys Leu Val 
370 

gtg gtc aag 
Val Val Lys 
385 

gac ttc atg 
Asp Phe Met 



ctg cac gcc 
Leu His Ala 



ggc ctg gtc 
Gly Leu Val 
435 



aac gcc ate 
Asn Ala lie 



ttc teg gcg 
Phe Ser Ala 
390 

ctg gtg ccc 
Leu Val Pro 
405 

atg cac tac 
Met Mis Tyr 
420 

gac acc gtc 
Asp Thr Val 



ggc acc aag 
Gly Thr Lys 
375 

ccc ctg gcg 
Pro Leu Ala 



teg cgc ttc 
Ser Arg Phe 



ggt acc gtg 
Gly Thr Val 
425 



tac aag ggc 
Tyr Lys Gly 
380 

cac atg etc 
His Met Leu 
395 

gag ccc tgc 
Glu Pro Cys 
410 

ccc gtg gta 
Pro Val Val 



cgc gcc aag 
Arg Ala Lys 



acc gcc ggc 
Thr Ala Gly 



ggc ctg ate 
Gly Leu lie 
415 

gcc tec acc 
Ala Ser Thr 
430 



ggc 1152 
Gly 



gcc 1200 

Ala 

400 

cag 1248 
Gin 



ggc 1296 
Gly 



1314 



Is*: 
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<210> 7 
<211> 438 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<400> 7 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 - 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 ' 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 * 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 . 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 
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Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165- 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys :Ile Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 1 430 

Gly Leu Val Asp Thr Val 
435 



<210> 8 
<211> 1593 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<220> 
<221> CDS 
<222> (1) . . (1593) 

<400> 8 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 48 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

1 : 5 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gec cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
I s * 65 70 75 80 

=3 gac cac ccc tgg ttc ctg gec aag gtc tgg ggc aag acc ggc tec aag 288 

Sb* Asp His Pro Trp Phe Leu "Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 

U 85 90 95 

IS ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 

*;r: Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 

^ 100 105 110 

ttc gee ctg ttc tgc aag gec get att gag get gee cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 



!□ 
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gee ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 



480 



ttc acc aag gec aag teg gtg ctg get ate cac aac ate gec- ttc cag 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 



528 



ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gec 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 , 190 



576 



gec ttt gac aag ctg gec ttc teg gac ggc tat gee aag gtt tac act 624 
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Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gcc acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 # 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 
Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 1056 
Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gcc aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 1344 
Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
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435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gec gac gec ctg gec 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 1440 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584. 
Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc 1593 
Lys lie Pro 
530 



% t| <210> 9 

<~j <211> 531 

il <212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 



5 
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<400> 9 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
1 5 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 ' 125 



Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 



4 * 



130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 - 215 220 

Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 . ■ 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 - 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu" Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 ' 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 ' 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 



Ala Thr Val 
465 

Glu Met Val 

Ala Gin Lys 

Gly Val Ala 
515 

Lys lie Pro 
530 



I el 
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Arg Arg Ala Ser 
■4.7 0 

Ala Asn Cys lie 
485 

Trp Glu Gly Leu 
500 

Thr Ala Lys Lys 



Glu Val Phe Ala Gly 
475 

Ser Gin Asp Leu Ser 
490 . 

Leu Glu Glu Val Val 
505 

Glu Glu He Lys Val 
520 



Gly Arg Tyr Pro 
480 

Trp Ser Lys Pro 
495 

Tyr Gly Lys Gly 
510 

Pro Val Ala Glu 
525 



